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Abstract. We give a new algorithm using linear approximation and lat- 
tice reduction to efficiently calculate all rational points of small height 
near a given plane curve C. For instance, when C is the Fermat cubic, we 
find all integer solutions of + - < M with < x <y < z < N 
in heuristic time ^ (log"^'^' A^)A/ provided M ^ N, using only 0(log A'') 
space. Since the number of solutions should be asymptotically propor- 
tional to Mlog A (as long as M < N'^), the computational costs are es- 
sentially as low as possible. Moreover the algorithm readily parallelizes. 
It not only yields new numerical examples but leads to theoretical re- 
sults, difficult open questions, and natural generalizations. We also adapt 
our algorithm to investigate Hall's conjecture: we find all integer solu- 
tions of < - y^l < x^l'^ with a; < A in time 0{X^I'^ log^'^'A). By 
implementing this algorithm with A — 10^* we shattered the previous 
record for x^^^ /\x^ - The 0(A^''^ log°^^^A) bound is rigorous; its 
proof also yields new estimates on the distribution mod 1 of [cx)'^^^ for 
any positive rational c. 



1 Introduction 

One intriguing class of Diophantine problem concerns small values of homo- 
geneous polynomials. In the simplest nontrivial case of a polynomial in three 
variables defining a projective plane curve C : P{X, Y, Z) = 0, the problem can 
be reformulated thus: given a plane curve C, describe for each positive N, 6 the 
rational points of height at most N in which are at distance at most i5 of C. 
With present-day methods, hardly any nontrivial results can be proved on the 
number or existence of such points. But one can still seek numerical evidence, 
and efficient algorithms for obtaining this evidence. The direct approach is to try 
all X, y with |x|, \y\ < N, and for each pair to solve P{x, y,z) = for z e [—N, N], 
recording those cases in which z is sufficiently close to an integer. This requires 
space 0(log(A^)) but time {N'^ + SN^) log^^^'' N , which is inefficient once d is 
much smaller than since for general C,N,5 3> N"^ the number of solu- 
tions should be proportional to 5N^. We give a new algorithm, also requiring 
only 0(log(A^)) space, but with heuristic running time (N+SN^) log'^^^^N. Thus 
as long as (5 3> we expect to find all the points of height < A'^ and distance 
< 5 in time only log'^^^^A'^ per point. Moreover, our method readily parallelizes, 
since it divides the computation into many independent subproblems. 
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We describe this algorithm, give the heuristic estimate for its run time, and 
briefly discuss the problem, which seems quite difficult, of proving our heuristic 
time estimates. We prove (Thm.^ that an alternative description of those points 
can always be computed in the heuristically expected time. We then discuss 
natural generalizations to other valuations and higher dimensions. 

An algorithm for finding rational points near a variety can in particular 
find rational points on the variety; applying our methods to embeddings of the 
variety in projective spaces of high dimension we obtain a new approach to 
this fundamental problem in computational number theory which improves on 
existing methods in several important cases. This approach also works for non- 
algebraic varieties, and even yields a theoretical result (ThmJJ) on the paucity 
of rational points on non-algebraic analytic curves. 

We next describe experimental results of the implementation of our algorithm 
to various curves of interest, notably the Fermat curves of degree n > 2, where 
some of our experimental findings led us to new polynomial families of small 
values of jz" — y" — x"! (Thm.|^). We devote a separate section to the case 
of the cubic Fermat curve, corresponding to small values of — — x^\, a 
problem for which there is already some literature and the heuristics are subtler. 
In particular, we found for several integers d < 10'^ the first representation of d 
as a sum of three integer cubes; D.J.Bernstein has since extended the search up 
to A'^ = 2 • 10^ and beyond, and found many new solutions, including one for 
d = 30 which was a long-standing open problem. 

Finally we show how to modify our algorithm to efficiently search for small 
nonzero values of |a;'^ — y^|. This is the topic of Hall's conjecture, which is part 
of a web of important Diophantine problems surrounding the ABC conjecture 
of Masser and Oesterle. The conjecture asserts that — is either zero or 
»e x^/^'^ for all x,y £ Z. We are able to find all solutions of < Ix^ — y"^] x^l"^ 
with a; < A in time 0(A^/^ log'^'^^-'A), again using only 0(log A) space. Using 
this improvement on the obvious Alog'^^^-'A' method of trying all a; < A, we 
computed all cases of < — y'^\ < x'^l'^ with A < 10^^. We found ten new 
solutions, including most notably 

5853886516781223^ - 447884928428402042307918^ = 1641843 

with x^l"^ j\x^ — y^l = 46.600+, improving the previous record by a factor of 
almost 10. In this case the time estimate is not heuristic; its proof not only 
streamlined the computation but even yields new theorems on the distribution 
mod 1 of (cx)^/^ for any positive rational c. We announce some of these results 
at the end of the present paper; the full statements and proofs will appear 
elsewhere. 
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2 The algorithm in theory 

2.1 Specification and heuristic analysis 

While we are mainly interested in algebraic plane curves C, the algorithm does 
not require so strong a hypothesis: we can find[| 2063'^ + 8093^ - 8128^ = 
0.019369- as weh as 386692^ + 411413^ = (1 - 1-035 . . . • 10-1^)441849^. All 
we need is that C is the image of a differentiable map (f> : [0, 1] RP^ with 
bounded second derivatives. Fix a positive S < 1, and assume 5 ^ N^'^ for rea- 
sons given in the next paragraph. Partition [0, 1] into 0{d^^^^) intervals Im each 
of length \Im\ = 0{S^^^). On each /„, approximate to within 0(|/mp) = 0{S) 
by a linear approximation 0. Then a point at distance < d from (j){Im) remains 
at distance ^ 6 from (f>{Im)- 

We now treat each Im independently. The triples {x, y, z) ^7? — {0} such 
that (a: : y : z) G has height < N and is within 0{S) of 4){lm) are among 
the nonzero integer points in a parallelepiped Pm of height, length and width 
proportional to N,S^^'^N,6N. Thus we expect that \Pm H Z'^j is approximately 
the volume of P„j, provided that this volume is ^ 1. This is the case once 
6 ^ N^'^. (That is why we insisted that S ^ N^'^: choosing smaller S would 
only make us work at least as hard to find fewer points.) Listing all the points in 
Pm n Z'^ is a standard application of lattice reduction. Let Mm be an invertible 

^ Our computations indicate that the first example is probably the smallest value of 
+y'^ — for positive integers x,y,z, and at any rate the smallest with z < 10^; 
and the second is the smallest ratio oi \x'' + y''^ — z""^] to 2^, and even to z**, for positive 
integers satisfying x < y < z < 10^. See the next section. 
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3x3 matrix such that MmPm is the cube K ~ [—1, l]'^. We are then seeking 
all u € Z'^ such that MmV € K , or equivalently all vectors iri K C] M^^Ti^ . We 
find them by reducing the lattice M~^Z^. This gives us a matrix Lm € GL3(Z) 
such that M„iL„i is small. Now MmV d K \i and only if w € Z'^ n {MmL„i)~^K 
where v = L,nW. But {M„iL„i)^^K is contained in the box centered on the origin 
whose i-th side is twice the norm of the i-th row of {MmLm)~^ (« = 1, 2, 3). For 
each nonzero integral w in this box, calculate (x, ?/, z) = LmW and test whether 
(x : y : z) in fact has height < N and lies within S oi C. Doing this for each m 
yields the full list of such points. 

As advertised, the algorithm requires only 0(logA'^) space (though much 
more space is usually needed to store the results of the computation). Also, 
since each of many intervals Im is treated independently, the computation can 
be massively parallelized with little loss among processors that interact only by 
reporting each {x : y : z) to headquarters as it is found. How long do we expect 
the computation to take? We assume that and its derivatives can be calculated 
to within A^~'^(i) in time log*^^^-* A^. Such is the case for all curves we consider and 
for every algebraic plane curve. Then each M„i takes only log'^^^^A^ operations 
to compute. Each lattice reduction can also be done in time polynomial in \ogN, 
since our lattices are in fixed dimension — and moreover our dimension of 3 is 
small enough that Minkowski reduction is described explicitly. [For an overview 
and further references concerning Minkowski reduction, see [CS, pp. 396-7].] So 
far this amounts to <C A^ time up to the usual log factors. Now each 

Prn has volume 23/|detM„| < (d'^/'^N)^. If each (M^i™)"! had all of its 
entries 0{S^^'^N) — equivalently, if the shortest nonzero vectors of each lattice 
M-^Z^ had length > S^^^^/N — then there would only be 0{{S^^^N)^) choices 
for w, which summed over m gives 0{SN^). Thus the total work would indeed 
be log'^^^^A^ times the expected number of solutions. Unfortunately it is too 
optimistic to expect that the entries of {MmLm)~^ are all <C S^^^N. If the 
lattices M^^Z"^ are randomly distributed in the space of lattices of covolume 
(5^/^/A^)^ in R'^, some of them will have nonzero vectors much shorter than 
5~'^/'^ /N . However, the average number of lattice vectors in K of a random 
lattice of determinant D is still 0{1/ D). Thus we expect — and typically find in 
practice — that, even accounting for the occasional short lattice vector, we will 
find all rational points of height < A^ that lie within 6 of C, doing on average 
log'^^^-'A^ work per point. 



2.2 Can the estimates be made rigorous? 

Our assumption that the lattices M:^^Z^ are randomly distributed was not 
proved; indeed it is false at least for some choices of C. Most glaringly, if C 
is a rational straight line then there are ^ A^^ rational points on C, and a for- 
tiori at least as many at distance < 5. While we of course will not apply our 
algorithm to straight lines, we do apply it to the n-th Fermat curve, which has 
contact of order n with several rational lines such as y = z; each of those lines 
contains ^ iV^"^/" points at distance ^ 1/A^^ from the curve, exceeding the 
expected count of A^log'^'-^-'A^ once n > 2. (These are the points we exclude by 
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imposing the inequality y<z in 0<x<y<z< N.) Assume, then, that C 
has at most finitely many tangent lines which have contact of order > 2 with C, 
and for any (5 > let Cs be the curve consisting of points of C at distance > i5 
from each of those higher-order tangent lines. For each point P on Cg we obtain 
a lattice Ls{P) C whose nonzero short vectors correspond to points near P 
in P^(Q), of height (5~^/^, lying at distance <C 5 from Cj. This gives a map 
Ai from Cs to the moduli space of lattices in R^. We would thus like to ask: as 
5^0, does the image of As become uniformly distributed in this moduli space? 

Th(TC are several problems with this formulation of our question. A minor 
one is that we have not defined As precisely enough for the question to make 
sense, because we have left some 0-constants unspecified. This did not matter 
for qualitative properties such as whether the lattice has 0(1) short vectors, but 
makes it easy to frustrate uniform distribution by simply choosing As to avoid 
a small region in the moduli spac;c. This problem is easy enough to fix for any 
given C; for instance, if C is given by x (x : y{x) : 1) for some differentiable 
function y : [0, 1] [—1, 1] with bounded second derivatives, we may take for 
As{x) the integer span of the columns of 

^ \ 

1 -X . (1) 
-y'/S 1/S {xy'-y)/5j 

But this brings us to a more serious difficulty. The question of whether As{Cs) is 
asymptotically uniformly distributed as 5 —» is likely to be a very hard problem 
in analytic number theory. For our purposes we are only concerned with how 
often and how close does As{P) come near the cusp of the moduli space. For 
instance, we see in the final section that if C is a conic then As{Cs) is restricted 
to a surface in the moduli space of lattices in R^, but within that surface it still 
approaches the cusp rarely enough that the average number of short vectors in 
a lattice in As{C) is still <C log(l/(5). In general, then, what we would like is 
the following result: as S ^ 0, the average number of vectors of norm < 1 of a 
lattice in As{Cs) is < log°(^)(l/5). 

This still looks like a very difficult problem. While it remains open, we pro- 
pose a contingency plan in case the lattices Ls{P) have many more short vectors 
than expected. If all the short vectors are multiples of a single vector of small 
norm, there is no difficulty, because all these multiples yield the same point 
in P^. But there could be two independent short vectors, whose linear combina- 
tions yield a line in containing many points of small height near C . We claim 
that this is in fact the only way that a lattice of covolume <C 1 could have more 
than 0(1) short vectors. This claim is easy enough to check using the descrip- 
tion of Minkowski-rcducod lattices in R'^, but we shall later need a generalization 
to lattices in higher dimension. We thus state and prove the generalization as 
follows: 

Lemma 1. For each positive integer n and positive realt there exists an effective 
constant Mn{t) such that the following hound holds: for any lattice A c R" whose 
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dual lattice A* has no nonzero vector of length < r, and for any R > 0, there 
are at most M„(r_R) r^"|yl|^^ vectors of length < R in A. 

Here is the covolume Vol(R"/yl). The lemma can be obtained as a con- 
sequence of the theory of lattice reduction, but it is not easy to extract M„(t) 
ex plici tly this way. We thus give the following alternative proof in the spirit 
of [CI I from which explicit (albeit far from optimal) bounds M„(t) may be eas- 
ily computed if desired. 

Proof. Given n, choose a positive Schwartz function / : R" —>■ R with the 
following properties: / is radial, i.e. f{x) depends only on |x|; and the Fourier 
transform / : R" R, defined for y E R" by 

f{y) / /(x)e2-(-'^)dx, (2) 

satisfies f{y) < for all y such that \y\ > 1. For instance, we may take 

/(x) = (|a;|2+a)e--^l-l' (3) 

where 

2n 1 n , . 

because the Fourier transform of a function (||) is 

|2 

2t:c c 



for any c > and a G R. By Poisson summation, 

E/(™) = ;4tE/V0. (6) 

Under the hypothesis on r, the only positive term in the sum over y is /(O). The 
sum over x is bounded from below by the sum over x of length < R, which is at 
least the number of such vectors times min|^|<^ /(rx). It follows that A has at 
most 

/(O) 



r"|yl|min|j,|<rfl/(a;) 
vectors of length < R, as claimed. 



^M^irR)r-^\A\-' (7) 



Corollary 1. For each positive integer n there exists an effective constant An 
such that if a lattice A C R" has more than AnR" /\A\ vectors of length < R for 
some R > then all those vectors lie in a hyperplane, which can be computed in 
polynomial time. 
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Proof. Except for the last phrase, this follows from the previous Lemma by 
taking r = 1/R and A„ = M„(l), since then A* must have a nonzero y of length 
at most r, and any vector of A of length < R must be orthogonal to y. To assure 
that y can be computed in polynomial time, we take r = c/i? for a positive 
constant c small enough that if A* has a nonzero vector of length at most c/R 
then the LLL algorithm will find a (possibly different) nonzero vector of length 
at most 1/R. Our Corollary now holds with An = c~"Mn{c). 

From the case n = 3 of this Corollary we deduce: 

Theorem 1. Let C be the image of a differentiahle map (j) : [0,1] RP^ 
with bounded second derivatives. Then for each N > 1 and 6 > N~'^ one can 
find 0{SN^) rational points and 0{N) rational line segments each of length 
0(1/ N) in which together include all rational points of height < N at dis- 
tance < 6 from C. These points and line segments can be computed in time 
« SN^log'^'^^^N. Outside of 0{dN^ log N) space used only to record each point 
or segment as it is found, the computation requires space <C log''^^^ A?'. All implied 
constants depend effectively on C. 

Note that here we do not exclude neighborhoods of high-order rational tan- 
gents to C; such tangents will contain some of the lines segments computed by 
the algorithm. 

2.3 Variations and generalizations 

The problem of finding rational points near plane curves is only the first non- 
trivial example of many analogous problems to which our method can apply. We 
briefly discuss some of these here. 

One easy variation is to change the norm: instead of approximating the curve 
in the real valuation, use a nonarchimedean one, or a combination of several. For 
instance, one can efficiently seek nontrivial triples of small integers the sum of 
whose cubes is divisible by a high power of 2 or of 10. Likewise one can replace 
Z by Fq[T] or similar rings in function fields of positive genus. The lattice- 
reduction step should then be even easier than in the archimedean case, though 
in the function-field setting our approach faces strong competition from the 
method of undetermined coefficients, and it is not clear which is superior. All 
these comments apply equally to the adaptation of our method to the problem of 
finding small nonzero values of — provided the characteristic is not 2 or 3. 
For the — problem, the work estimates are again rigorous; otherwise, they 
are still heuristic, but their analysis may be more tractable in the function-field 
case. 

Higher dimensions present many new opportunities. The easiest generaliza- 
tion is to a hypersurface in P*^"^. Here we are seeking small values of a 
homogeneous function of k variables evaluated at an integral point. This time 
we chop the hypersurface into 0{6~'^'^~'^^^^) chunks each of diameter 0{S^^^), and 
replace each chunk by a subset of a hyperplane which approximates it to within 
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0{6). The points of height < N that are within 0{S) of this chunk then come 
from integral points in a parallelepiped in R'^ whose sides have lengths 0{N), 
0{N5^/'^) {k — 2 times), and 0{NS). Again most of these this parallelepipeds 
have volume 3> 1 provided 6 ^ N^'^, and we locate the integral points using 
lattice reduction in R*^. So, as long a,s 6 ^ N^'^, we expect to find on the order 
of SN^ points, using ^ log'^'^^^^A^ space and spending <C log*^*"*-^^ time per 
point. For a general hypersurface, this again improves on other approaches to 
the problem. But the improvement decreases with k: the direct approach takes 
time iV'^-Mog'^^^^A^, and we lower the exponent by a factor no better than 
(fc — 2)/(fc— 1), which approaches 1 as fc ^ oo. Moreover, lattice reduction in R'^ 
quickly becomes difficult as k grows. Another consideration is that for special 
surfaces there are known, and simpler, algorithms that take time A^'''~^ log'^'-^-'A^ 
or less once k > 3 . For instance, for Fermat surfaces in P'^, one readily adapts 
the method of |B1| to find all solutions of + — zL + 0{z^~'^) in posi- 
tive integers with t < z < A^, in expected time A^^ log*^*-^-* A^, and with no need 
for lattice reduction in R^ or other complicated ingredients. This computation 
does require space proportional to NlogN, which however poses no difficulty 
for practical values of A^. As in the previous paragraph, all that is described in 
the present paragraph can be done also for a nonarchimedean norm, with similar 
results except that lattice reduction over a function field is tractable even for 
large k. In either case rigorous estimates may become even less accessible as k 
grows. 



We can generalize further to manifolds A4 C P'^~^ of codimension c > I. 
Here we expect to find on the order of ^'^A^'^ rational points of height < A^ at 
distance 0{S) from Ai. We chop A4 into 0((5~^'^~^~'^-'/^) patches of diameter 
0(5^/^), each of which yields a parallelepiped in R''' with dimensions of order N 
(once), NS (d times), and NS^^"^ (the remaining fc — 1 — c dimensions). We 
thus expect to efficiently find all ~ points as long as ^ ]\[-^k/{k+c-i) _ 

A further possibility emerges if Ai has bounded derivatives past the second 
derivatives and has small enough dimension compared with k: we can then make 
further headway when S falls below that threshold. Usually we are only interested 
in points much closer than but as long as we use only the 

structure we gain nothing by making S even smaller, so we may as well find all 
the points at distance 0(A^^2fc/(fe+c-i)^ ^^^^^ locate the best approximations in 
the resulting list. However, if Ai is and its dimension ci = fc — I — c is so small 
that k > ('^^^) , then a patch of diameter e is contained in a box with d sides of 
length ^ e, a further {d'^ + d)/2 sides of length <^ e^, and the remaining k—(^~^'^) 
sides of length <C e'^. This means that we can makes our parallelepipeds thinner 
in some directions, and thus use wider patches of A^, covering the entire manifold 
with fewer of them. This lets us locate the points of height < A^ closest to A4 
in time significantly less than it would take to record all the points at distance 
<^ jY-2fc/(fc+c-i)^ even though not so efficiently that we only spend <C log'^'^^^^A^ 
time per point. More generally if is a C manifold we can exploit bounds on 
the i-th derivatives once k > {"^^^l^)- 
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If the ambient projective space is not of high enough dimension, we can still 
make some use of approximations to A4 of degree i > 1 by using the i-th Veronese 
embedding Vi of P'^^i into projective space of dimension C^^p^) — 1- [The i-th 
Veronese embedding takes the point with projective coordinates {Xi : • • • : Xk) 
to the point whose projective coordinates are all C^^p^) monomials of degree i 
in the Xj. Thus Vi raises all heights to the power i, and transforms intersections 
with hypersurfaces of degree d in p^~^ into hyperplane sections in a projective 
space of much higher dimension. For more on Veronese embeddings, see for 
instance fFH|, where they arise several times.] The idea is to surround each 



patch of Vi(A^) by a box containing all points in 1^(P'^~^) at distance 0{5) from 
Vi{A4). The resulting asymptotic improvement may be only barely worth it in 
practice, though. Consider the simplest case of a curve C € P^, embedded 
in P^ by V2. Assume for simplicity that the parametrization (f) oi C has \(f>"\ 
bounded away from zero. Then, for 6 such that <^ 6 ^ e^, the radius-(5 
neighborhood in P^ of an interval of length e on C maps into a box in P^ whose 
sides are of order e,€^,6,e^,e'^. [To see this, choose coordinates {Xq : Xi : X2) 
on P^ for which (p is of the form {1 : t : t'^ + 0{t'^)) for t in a neighborhood of 0, 
and note that V2 takes {Xq : Xi : X2) to {Xg : XqXi : : X^ : X1X2 : 

Thus the points of height at most N in that neighborhood map to lattice points 
in a 6-dimensional parallelepiped of volume ^ SN-^'^e'^'^. (Here N occurs to the 
power 12 rather than 6 because V2 squares the height of each rational point.) 
Thus if we take e = ((5iV^^)~^/^° we expect to find all points at distance ^ S 
from C, of which there should be about SN^, in time N{SN^y/^° \og^^^'>N. The 
condition S ^ yields S ^ 7V^36/i3^ save a factor of at most N-^^^^. We 

pay not only by missing the points at distance between N~^^/^^ and (which 
usually do not interest us anyway) but also by reducing lattices of rank 6 rather 
than 3. This takes more time per lattice, and probably yields parallelepipeds 
whose average bounding box is larger. Each of these effects amounts to only a 
constant factor, but these factors may be considerable, and it will be interesting 
to see how large N must be for this use of V2 to be practical. 



2.4 Rational points on varieties 

In the last paragraph we exploited the fact that points near M map under Vi to 
points that are not only near Vi{M) but exactly on Vi(P'''^^). We can go much 
further when we search for points exactly on M . Again we consider the simplest 
case of a curve. We begin with a curve in one projective space: 

Theorem 2. Let C be an algebraic curve in A'l -dimensional projective space, 
defined over Q and not contained in any hyperplane. Then for any N > I the ra- 
tional points of C of height at most N can be listed m time <c N'^^^'' log°"'^'iV. 
The implied constants depend effectively on d and C . 

Remarks. As seen above for M = 2, this result applies more generally to a 
qM Q^j-ve in P*^ whose intersection with any hyperplane can be computed in 
polynomial time. The exponent 2/M is best possible: a rational normal curve 
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of degree M (a.k.a. the image of under Vm) has on the order of N"^/^ 
rational points of height at most N, and it takes time 3> N'^/'^logN just to 
write them down. The constant imphed in Od(l) and/or while effective, 
may be unpleasant in practice for large M, since lattice reduction in dimension 
M + 1 is involved. 

Proof. A segment of C of length ^ TV^^/a/ contained in a box whose i-th side is 
< Ar-2i/Ji^ (i = 1, 2, . . . , Af ). The rational points of height at most N in this box 
come from points of Z^^+^ contained in a box B whose i-th side is ^ jSf^-"^"^/^ 
(i = 0, 1,2, ...,M) and thus has volume 0(1). It takes time <c log°*'^^^A^ 
to apply lattice reduction and, by Corollary |l|, either list Z*^+^ n B or find a 
hyperplane containing 7*^+-'^ n B. In the former case, we test whether each of 
the resulting Od(l) points lies in C. In the latter case, we map this hyperplane 
to P^^ and intersect it with C, finding at most deg(C) — 0(7(1) rational points. 
Thus in either case we find all rational points of height < TV on our segment in 
time <Cc log*^"^^^ A'^. Since it takes only 0{N'^^^^) segments to cover C, we are 
done. 



It might seem that this algorithm is supcrfiuous: if C has genus then its 
small rational points may be found directly from a rational parametrization, 
without any lattice reduction; and if C has positive genus then we can find all 
its points of height < in time ^ log'^'-^-'A^ once we have generators of the 
Mordell-Weil group of the Jacobian of C. But the difficulty is that we must 
first find these generators, and this requires locating rational points on a curve 
or a higher-dimensional variety. For instance, to find the Mordell-Weil group of 
an elliptic curve E we usually apply a few descents and then search for points 
on certain principal homogeneous spaces for each of which is a curve C of 
genus 1, usually (in the case of a complete 2-descent) of the form y"^ = P{x) for 
some irreducible quartic P G Z[X]. One then searches for a: S Q of height up 
to H for which P{x) G Q^. There are on the order of candidates for x\ one 
can set up a sieve to efficiently try them all, but this still takes time \og'~^^^^H 
(and significant space) . Instead we can embed C in P^ as the intersection of two 
quadrics (by writing P{x) as a homogeneous quadric in l,x,x'^), and use the 
algorithm of Thm.^ with A^ = to find all rational solutions of = P{x) 
with X of height < iJ in time H'^/^ log^'^^^H. For certain E one can use Heegner 



points to locate a rational point on C to within S (see [E3 ); if (5 <C 7J , this 



is sufficient to identify x using continued fractions, a.k.a. lattice reduction in 
dimension 2. Using the new algorithm, we see that S ^ H^^/^ suffices if we use 
lattice reduction in dimension 4. This saves a constant factor in the computation 
of X, since fewer digits and terms are needed in the fioating-point computation 
of Heegner points. When C has genus > 1, there are only finitely many rational 
points by Faltings' theorem, but they still may be of significant number and/or 
height. For instance, in [KK,|] one finds curves C : y^ — P{x) of genus 2 which 
have hundreds of rational points. In both cases, all points with x of height < 10^ 
were found using the iJ2 logC'-^^H sieve method, a substantial computation. At 
least in the case considered in Q , where the Jacobian of C is absolutely simple 
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with large Mordell-Weil rank, it would probably be even more onerous to find 
all these points by first determining the Mordell-Weil group. But the embedding 
(1 : X : : : y) of C into yields an improvement from to H^^^ with 
5-dimensional lattice reduction. 

We can do even better by mapping the same curve to larger projective spaces. 
Fix an algebraic curve C of genus g defined over Q, and a divisor on C of 
degree d > 0. For n sufficiently large, the sections of nD embed C into P"'='-ff. 
This embedding sends any rational point on C of height (exponential, as usual 
here) < H relative to £> to a point on P"''-^ of height <C -ff". By Thmj| again, 
we can find all such points in time <C i/2"/(»<i-9) log'^'^-'^^iJ. Letting n oo, we 
conclude: 

Theorem 3. Fix an algebraic curve C/Q and a divisor D on C of degree d > 0. 
For each e > there exists an effectively computable constant such that for 
any H > 1 one can find all points of C whose height relative to D is at most H 
in time 

For instance, all rational points on = P{x) with x of height at most H 
can be computed in time 

What of varieties M. of dimension A > 1 in P*^? A chunk of radius 5 
then yields the intersection of Z*^+^ with a box with sides as follows: one of 
length 0{N), A sides of length 0[N6), {'^+^) sides of length OiNS"^), 
('^+;-^) sides of length 0{N5'), . . . until \^+') = J^^^^ i"^^''^) first exceeds 
M. As usual we choose S so that the product of these sides is 1, and apply lattice 
reduction to each of 0((5^'^) chunks. The difficulty here is that if the lattice is 
nearly degenerate, the hyperplane found in Corollary |^ meets Ai not in a finite 
number of points but in a subvariety of positive dimension A — 1. This suggests 
an induction on A, since we can apply our method to that hyperplane section 
of A4. But already for A = 2 such an argument requires a version of Thm.H 
with more uniformity in the implied constants than we know how to obtain. 
However, as with our first nontrivial case of curves in P^, we do not expect such 
degenerate lattices to arise in practice often enough to raise the computational 
cost above 0(5~''^ log'^^^-'iV), except for a finite number of proper subvarieties 
of A1. If we assume this, we can again obtain better estimates by embedding 
A4 in larger projective spaces. Fix an ample divisor D on A4, and ask for all 
rational points whose height relative to A4 is at most H. Using the sections of 
nD to embed A4 in projective spaces, and letting n ^ oo, we find the following 
heuristic generalization of Thm.^: for each e > 0, there exists a proper subvariety 
Aio{e) of A4 such that all points of — M.o{e) of height at most H relative 
to D can be found in time 

0,(iJ«^+i)/l^l)+^), (8) 

where \D\ is the Z\-th root of the intersection number D^. One might even hope 
that Mo{e) can be taken independent of e. For instance, if is a surface of 
degree d in P^ then we expect that, for some union A^o of curves on AJ, we 
can find all rational points of height < N on M ~ Mo in time A^('^/^)+'^. 
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We must admit that this is unlikely to yield a practical improvement over the 
7V2iogO(i)7V method we already knew: the first Vi that reduces the exponent 
of N below 2 is V3, and then (assuming d > A) the exponent drops only to 24/13 
— but instead of reducing 4-dimensional lattices we are then faced with lattice 
reduction in dimension 20. It will probably be a long time before N can feasibly 
be taken large enough that this extra effort is worth the N^/^^ factor gained. 

Returning to plane curves, we can use this idea to prove an even stronger 
bound on rational points on a plane curve C that is analytic but not algebraic. 
This is because the homogeneous monomials of degree i in the coordinates of C 
are linearly independent for each i, so Vi{C) spans a projective space whose 
dimension grows quadratically in i (whereas for an algebraic curve the growth 
is always linear). This leads us to the following result: 

Theorem 4. Let C be a transcendental analytic arc in P^, i.e. C — {f{x) : 
a < X < b} where f is an analytic map from a neighborhood of [a, b] to 
whose image is contained in no algebraic curve. Then for each e > there exists 
a constant such that for every H < 1 there are fewer than A^H'^ points of 
height < H in C f] P'^ (Q) . 

Proof. For each positive integer i consider Vi{C) C p(*^+30/2^ Since C is tran- 
scendental, Vi{C) is an analytic arc Viof contained in no hyperplane of P*^* +3i)/2^ 
Now apply the argument for Thmj2|with N = H^. As noted in the remarks fol- 
lowing the statement of that theorem, the curve need not be algebraic as long 
as it is C^^ and its intersection with any hyperplane is of bounded size. (Here 
we need not compute this intersection numerically, since we are only bound- 
ing the number of rational points of small height on C, not computing them 
efficiently.) The differentiability is clear since Vi{C) is analytic, and the bound- 
edness is proved in the next lemma. We conclude that the number of points of 
height < i? on C is ff4/(«+3)_ 

Since i can be taken arbitrarily large, our 

theorem follows. 

The existence of an upper bound on the size of the intersection of any hyper- 
plane with Vi{C) is a special case of the following lemma in complex analysis. 
Throughout the lemma and its proof we count zeros of an analytic function ac- 
cording to multiplicity, even though in the application to Thm.^ a multiple zero 
is no worse than a simple one. 

Lemma 2. Let E be an open subset of C and V a finite- dimensional vector 
space of analytic functions: E C. Then for any compact subset K <Z E there 
exists an integer n such that any nonzero f €z V has at most n zeros in K . 

Proof. Fix K . We shall say that a compact K' d E \s "good" if its boundary 
d{K') is rectifiable and its interior K' — d{K') contains K. Choose a good Ki, 
and define a norm on V by ||/|| = sup^g^/ 1/(^)1. Let Vi be the unit ball {/ e 
V : 11/11 = 1}. It is sufficient to prove the lemma for / e Vi. 

For each f G Vi choose a good Kf C K' such that / does not vanish on 
d{Kf). Let = inf^g9(x^) |/(2;)|, and let nj be the number of zeros of / in Kf. 
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By Rouche's theorem, if g €z V with ||/ — g\\ < Vf then g has at most Uf zeros 
m Kf, and a fortiori in K . Now Vi is compact and is covered by the open balls 
Bj of radius rj about / € Vi. Thus there is a finite subcover {-B/j}f£i. Then 
n := maxi rif^ is an upper bound for the number of zeros in K of any / e Vi, 
and thus of any nonzero F . 

To recover our result on hyperplane sections of T^i(C), take K = [a, 6], let E 
be a neighborhood of K on which / is analytic, and choose any analytic functions 
foifi:f2 on E such that f = {fo '■ fi '■ f2) on E. Then take for V the space 
of homogeneous polynomials of degree i in /o,/i,/2- If we understand / well 
enough to obtain for each i an effective bound n in Lemma ^ then the constants 
A, in Thm.| are effective too. 

With a little additional work Q can be replaced by an arbitrary number 
field F embedded in C, and C by f{K), where ii' C C is any compact subset 
and / is again an analytic map from a neighborhood of K to whose image is 
contained in no algebraic curve. 

A separate approach to bounding the number of rational points on curves 
was initiated in | |BP[ an d pursued further in Q and [HB2|. For example, Heath- 



Brown obtains in |HB2| bounds on the number of rational points on an algebraic 
plane curve that coincide with the time estimates in our Theorems ^ and ^ 
Moreover, our Thm.^ is contained in Thm.8], which asserts that for a num- 
ber field F with [F : Q] = n the number of i^-rational points of height < H 
on a transcendental analytic arc C is at most Ac^n.eH'^. Probably the methods 
of [BP,^ can also prove these results with arcs C replaced by compact tran- 



scendental curves f{K), and our bounds can also be made uniform in F given 
[i^ : Q]. There is clearly some overlap between the two approaches; for instance 
the Corollary preceding [p , T hm.8] is the same as our Lemma H, but proved 
using the determinants of |BP,^. What is not clear, but intriguing, is whether 



those determinantal methods and our lattice-reduction technique can ultimately 
be interpreted as facets of the same basic idea. 

All this also suggests the question of whether a transcendental arc can contain 
infinitely many rational points, of whatever height. I thank Michel Waldschmidt 
for pointing out that this question was already asked, and later answered affir- 



matively, by Weierstrass. See [M2, Chapter 3] for this and related results. 



3 The algorithm in practice 

In this section we report on the outcome of the application of our algorithm to 
various plane curves, and on some results suggested by our findings. We suppress 
details of the exphcit constants replacing each 0{- ■ ■) and <C; these details are 
of course crucial in practice, but are straightforward and not enlightening. In 
each case our curve has some rational points of inflection, and we make sure to 
truncate our curve enough to avoid the tangents at such points but not so much 
that we lose approximations near but not on those tangents. 

In general, for a plane curve given by a homogeneous equation P{X, Y,Z)=0 
of degree n, we associate to a rational point {x : y : z) near but not on the curve 
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the number 

nmax(|x|,|2/Uz|)"-V|P(a;,y,z)|, (9) 

which measures how close the point (x : y : z) is to the curve relative to the 
point's height. We insert the factor n so that we can reasonably compare ap- 
proximations for curves of different degrees. For instance, for the Fermat curve 
one expects that as x, y vary, the integer z" := 2;" + y" comes on average within 
jnz"~^ of the nearest n-th power of an integer, and thus that the smallest value 
of |z" — y" — x"| for z G [N,2N] is proportional to nz"~^. One could insert 
further factors to correct for the length and shape of our curve, but these factors 
are not significant for most of the curves we study. 

We noted already that the heuristics leading to formulas such as refer to 
"random" {x : y : z) near the curve, not for systematic families of approximations 
which may attain values of the ratio (^) larger or more often than expected. We 
again give an example for the Fermat curves, which were the subjects of most of 
our computations. One usually guesses that for each r there will be <C rlogiV 
triples {x, y, z) such that the ratio (|^) exceeds r. However, in the identity 

[t + 1)" - (t - 1)" = 2nr-i + 0(r-3), (10) 

we can make 2nt"^^ an n-th power by setting t — 2nw"; this yields 3> A^-'^/" 
triples with (||) bounded away from zero. We note the special cases n = 2, 3 of 
this identity: for n = 2, the 0(i"~^) error vanishes, and we recover a familiar 
parametrization of Pythagorean triples; for n = 3, the error is constant, and 
we can scale the identity to obtain the known family of solutions {x, y, z) = 
{6t^, — 1, 6t^ + 1) of z^ — y^ — x^ — 2. Returning to general n: in our searches 
we set the threshold on z"~^/|z" — y" — low enough to find all the examples 
coming from (^o|), as a check on the computation; but we chose a higher threshold 
for the tabulation of results so that our list is not dominated by this polynomial 
family. 

3.1 Fermat curves of degree > 3 

We implemented our algorithm to find small values of jz" — y" — a;"| with < x < 
y < z, 4 < n < 20, and z € [10^, 10^]. Since the threshold for "small" depends on 
the size of z, we wrote [10'^, 10^] as the union of 10 intervals [A^/2, TV] and treated 
each separately. We also used a direct search for z < 5000, using the overlap 
region [1000, 5000] as a check on the computation. We did not attempt to fine- 
tune the algorithm for efficiency, since we carried it out more as a demonstration 
project than a major computational undertaking. Thus we programmed the 
search in gp, using the built-in arithmetic and LLL lattice reduction. We estimate 
that transcribing the program to C, and replacing LLL by Minkowski reduction 
in R'^, would speed the computation by roughly an order of magnitude; of course 
a machine faster than a Sun Sparcstation Ultra 1 would help too. With a C 
program and a more powerful machine, it should be feasible to search the range 
n G [4, 20], z < 10^ in time on the order of a month. 
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The behavior of the run times and the counts of solutions with |2:"— y"— x"| <C 
z""^ seem broadly consistent with our heuristics, though we have not attempted 
a detailed statistical analysis. We tabulate the most striking examples, those with 

r := nz''-^l{z'^ - y" - a;") (11) 



of absolute value at least 4: 
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All decimal values of r are rounded to the nearest tenth. If for some integer 
A > 1 we have r > 4A^ then (Ax, Ay, \z) will also appear in the table provided 
\z < 10^; this happens for A = 2 at n = 4, 5, 7, 10, and for A = 3 at n = 5, 10. The 
first examples for n = 10 and particularly n = 5 (where 13^ + 16^ = 17^ + 12) are 
small and striking enough that one feels they must have been observed already, 
but I do not know a reference. On the other hand, the first two examples for 
n = 12 have been published, and in a most unlikely place: each appeared in a 
different episode of the popular animated cartoon The Simpsons. Perhaps the 
third example for n = 12, or an example with n = 7orn = 15, could be used if 
the cartoon repeats this theme once more; the relative error |z" — y" — x"|/z" in 
each case is between 1 and 2 parts in 10^^, as compared to 3 • 10~^° and 2 • 10~^^ 
for the two four-digit examples. . . 

Frivolity aside, one is struck by the pair of examples for n = 8. The values of 
r are far from the largest in the table, but they are almost equal and opposite, 
and involve nearly equal triples {x, y, z) for which z — y has the same small value 
of 12. This suggests that we are dealing with a polynomial family {x{t), y{t), z{t)) 
specialized at t = ±to- Indeed we quickly find that these are the cases t = ±3 of 

(32i^ + 6tf + (32i® + 7f = (32i^ + lOi)^ + 21 • 2'^H^° + 0(i^^), (12) 
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with r — i^/21 + 0{t^^). Thus arbitrarily large values of r occur, and indeed 
— — can be as small as 0{z^^/^) rather than the expected 0{z^). Trying 
to generalize the identity (12) further, we soon find that there are similar families 
for any exponent n such that 3n(n — 2) is a square: 

Theorem 5. Let n > \ be a positive integer. Then there exist polynomials 
x{t),y{t),z{t) € Z[t] of the form 

x{t)^Ct" + D, y[t) ^ Af^^^ + Bt, z{t)^Af'+^+B't (13) 

with A ^ 0, B' ^ B such that z^ — y"' — is a polynomial of degree at most 
n{n — 3), if and only if 3n(n — 2) is a square. In that case, there exist infinitely 
many integer triples (x, y, z) with < x < y < z such that z" — — <^ 

^(n^-3n)/(n+l) 

Proof. Let b, b' be the distinct rational numbers B/A, B' /A. Expand z" — at 
infinity: 

fin — 1 2 
z" - = nyl"(6' - 6) U" + + (14) 

^ (n^l^n_-2) ^^,2 ^ ^ 52)^n^-2„ ^ 0(^n^-3n. 

For this to be of the form (Ci" + Z?)" + 0(i"'-3n) ^^^^ ^^^^^ 

(n - 1) (^(6' + b)J = 2n ("-^)g("-^\ fe'^ + b'b + b^). (15) 

The discriminant of this quadratic equation in b' /b is 3n(ri — 2) times a square; 
thus (|5|) has nonzero rational solutions if and only if 3n{n — 2) is a square. 
Explicitly we find that b, b' are proportional to {n^ — 2n)/3 ± 1. 
Conversely, suppose — 2n = 3m^ for some integer m. Let 

z = A(r+i+c(TO+l)t), y = yl(t"+i +c(m- l)t). (16) 

Then 

t" + ^^cmj +0(r'-3"). (17) 

To make this (Ci" + £»)" + 0(t"'-3n) ^-^^-^ C\D eZwe now need only choose 
nonzero c € Z so that 2cn is an n-th power (e.g. take c = (2n)"~^), and 
then choose A so that n\Acm. Specializing t to sufficiently large integers in 
the resulting {x{t),y{t), z{t)) yields infinitely many integer triples {x,y,z) with 
< X < y < z such that z"^ - y"^ - < z^" -3n)/(n+i)^ ^s claimed. □ 

The smallest n > 3 such that — 2n = 3m^ is n = 8. There are infinitely 
many further examples, starting with 27, 98, 363, . . . , and parametrized by a 
Fermat-Pell equation. Dropping the constraint n > 3 yields the further cases 
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n = 2 and n — i. For n = 2 we again obtain a Pythagorean parametrization, 
this time with x, y, z multiphed by t\ for n = 3 we find 

(9^3 + 1)3 + (M4)3 „ (9^4 3^)3 ^ (18) 

one of infinitely many polynomial solutions of + _ = 1. 



3.2 The Fermat cubic 

Our algorithm applies to the Fermat cubic as it does to the Fermat curves of 
higher degree, but we treat it separately both because the heuristic analysis 
is subtler and because the problem of finding small values of \z^ — — 
has already attracted some attention. We noted that in general we expect the 
smallest values of |z" — y" — to be comparable with z"^^ Yor n = 3, we have 
^"■"3 = \^ and of course (given this case of Fermat's Last Theorem) |z3 — y3 — x3| 
can be no smaller than 1 for nonzero integers x, y, z. Moreover, z3 — y3_2;3 cannot 
be an arbitrary rational multiple of z"^3; Q^ly the discrete values ±1, ±2, . . . may 
arise. Thus, instead of a Diophantine inequality z" — y" — x" ^ z"~3, we have 
a family of Diophantine equations z^ — if^ — = d {d Tj), and new tools can 
bear on solving them or, failing that, describing their distribution of solutions. 
These equations have been investigated by various means since the beginning of 
the computer age; see Q for references to work up to about 1980 (some of which 



dates back to the 1950's), and [B2 CV,HBLR,KT£,PV| for more recent resuhs. 



As we shall see, the problem has been approached in several ways, some of which 
already improve on direct exhaustion over some 

iV^ values of (x,y). StiU, our 
new linear approximation method is better yet, both in heuristic theory — even 
though by factors smaller than our accustomed N/ \og^'^^^ N — and in practice, 
as evidenced by the computation of many new solutions. Our discussion here 
applies with almost no change to other "diagonal" cubics, such as x^ + + 2z^ 
which was also singled out in Prob. D5]; but we have not yet implemented a 
search for small values of |x3 + y3 -)- 2^3 1 beyond what has already been reported 
in the literature. 

For each nonzero d, the expected distribution of solutions of 

~3 y^-x^ (19) 



z 



involves not only considerations of size — i.e. of local behavior at the archimedean 
place of Q — but also on the behavior of z3 — y3 — at finite primes p: each p 
contributes a local factor fp{d) that is the ratio of the p-adic measure of the Zp- 
points of (^9|) to the average of that measure as d ranges over Zp. For instance, if 
any of those factors fp{d) vanishes, there can be no solutions at all. It is not hard 
to see that the only such local constraint is d ^ ±4 mod 9. Fo r such d, the re- 
sulting product over p was investigated by Heath-Brown ||HB1| . He showed that 
the product does not converge absolutely, but can nevertheless be analyzed and 
approximated numerically by comparing fp{d) with the factor at p of the Euler 

product for (Cq(^)(s)/C(s))'^ at s = 1, which differs from fp{d) by a factor of at 



18 Noam D. Elkies 



most 1 + 0{p ^/^). The product Y[p fp{d) is then seen to diverge to +00 if d is a 
cube and to converge to a positive limit when d is neither a cube nor congruent 



to ±4 mod 9. Heath-Brown thus conjectured in |HB1] that aU nonzero integers 
d ^ ±4 mod 9 occur as — — infinitely often. So far this is only known 
when d is either a cube or twice a cube, thanks to polynomial parametrizations, 
which the above heuristics do not try to account for. We have already exhibited 
polynomial solutions for d — 1,2. For many d^ A mod 9 which are neither cubes 
nor twice cubes , not a single solution is known for z^ — y'^ — x^ = d. Heath-Brown 



observes HB1 | that this is not surprising, because for many of these d the ex- 
pected number of solutions with z G [iV, lO^iV] is positive but smaller than 1. 
Guy lists the cases with d < 10'^ which were open as of 1980, and while the 
list is now shorter the question of which integers are the sums of three cubes 
is not yet settled even in that range. For instance, the case d = SO was open 
until 1999, and had been the smallest open case for several decades. 

We have noted already that a direct search finds all small \z^ — x^ — y^\ with 
z < in time iV^ log'^'-^-'iV. There have been several improvements on this, all 



obtained by rewriting the equation (|19|) as 

x^ + d = z^ -y^ ^ {z-y){z'^ + yz + y'^). (20) 

Once a;'^ -f d is factored, which takes heuristic time iV°(^\ all solutions of ( p0| ) 
can be found by trying each factor of x^ + d for z — y. Given the value of d, this 
takes time only iV^+°'^^^. In addition to dealing with only one d at a time, this 
method has the disadvantage that the N°^^'> time required to factor x'^+d, though 
subexponential, is still considerable. The advantage of this method is that it finds 
all solutions with x < N, while y, z may be considerably larger, of order up to 



^3/2 y[Qj^y Qf ^jig jjg-y^f solutions found in KTS | are of this type, with y, z large 
but z — y very small. Heath-Brown observed that, again given d, the factorization 
of x^ + d can be simplified by a precomputation in Z[syd], though the complexity 
of the precomputation depends unpredictably on d via the arithmetic of the 



number field Q(v/d); this approach was implemented in [HBLR|. Note that in 
effect these methods find rational points near the Fermat cubic that are close 
to the tangents to the curve at its inflection points — the same tangents that 
demand special care in our algorithm. A further variation which we suggested 
in 1996 is to use the factorization 

z^ - d^ x^ + y-^ = (x + y){x^ + xy + y^) (21) 

as follows: fix x+y, solve for z mod x+y, and try each of the resulting values of z. 
Here we only find solutions with z, not x, bounded by N, but the advantage is 
that factoring costs are greatly diminished. To find all cube roots of d mod x + y 
requires factoring x + y, a, number of size N rather than N^; and with enough 
space to set up a sieve the factorization can be avoided entirely. In 1999, Eric 
Pine, Kim Yarbrough, Wayne Tarrant and Michael Beck, all graduate students 
at the University of Georgia, took up this suggestion, choosing d = 30, and found 
the first solution: 

30 = 2220422932^ - 283059965^ - 2218888517^ (22) 
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We announced our new algorithm in the same 1996 posting to the NMBRTHRY 
maihng hst, together with results of a search for solutions with z < 10^ and 
|d| < 10'^. We did our search in gp, making our computation easy to program 
(since gp aheady provides muhiprecision arithmetic and lattice reduction) but 
far from optimally efficient. In 1999, unaware of the work of the Georgia group, 
we asked Dan J. Bernstein for an efficient implementation. He soon wrote a C 
program that found all solutions with z < 3 ■ 10^ and \d\ < 10'*, including ( p2[ ) 
and many others. Several values of d had not been previously represented as 
the sum of three cubes. Detailed results and analysis will appear elsewhere. As 
usual, since we are interested in small d, not all d <^ N, the improvement by a 
factor iV^/^^ should apply here as well to find all cases of \z^ — — \ <C z^^^^ 
with z < N, but we have not attempted to implement such a computation. 



3.3 Miscellaneous examples 

Trinomial units. One sometimes sees in Olympiad-style mathematics contests 
the question "Is z^/'^ greater or smaller than x^/'^ + y^^^l" for some specific 
positive integers x,y,z. Of course this is a challenge only when the sign of the 
difference := z^^^ — {x^^^ + y^^^) cannot be determined by inspection. In 
some cases the question be settled by applying classical inequalities; for instance 
if a > 6 > then (a + 6)^/'^ + (a — b)^/'^ < 2a^^^ by convexity of the cube root. 
The general solution is to compute the norm of u^/z^^^, an algebraic number of 
degree 9 none of whose other conjugates is real unless x — y. We find that 
has the same sign as 

'N[x,y,z):={z-y-xf ~27xyz. (23) 

Moreover, given the size of x, y, z, the smaller N(j:, y, z) is, the nearer will be 
to 0. In particular, we would like to have N(a:, y, z) — ±1, which would make the 
algebraic integer W3 a unit. Thus again we seek rational points close to a plane 
cubic curve, here N(a;, y, z) — 0. This time the curve is rational: by construction, 
it is parametrized by (a; : ?; : z) = (t'^ : (1 — t)'^ : 1). It is thus not smooth, but its 
only singularity is the isolated point {x : y : z) = {1 : —\ : —\) (geometrically a 
node with complex conjugate tangents), which does not affect our algorithm. The 
three rational points of inflection at xyz = z — y — x^Odo affect our algorithm, 
but fortunately we are not interested in the points on their tangent lines, since 
those are the points with xyz = 0. We thus restrict our attention to the portion 
of the curve with x/z,y/z > l/N, i.e. with t > iV~^/^ and 1 - i > N^^/^ in 
the rational parametrization. This takes us far enough from the inflection points 
that they cause us no difficulty. 

The situation is now much the same as for z^ — y'^ — x^ ^ d. We expect the 
number of solutions of l>i{x,y, z) = d oi height up to N to be proportional to 
log times a product of local factors Qpid). The only local factor that can vanish 
is gz{d), which is nonzero if and only if 9|d or d = ±1 mod 9. We henceforth 
assume that d is in one of these congruence classes. We can then check whether 
n„5p converges by comparing it with the L-series of the projective cubic surface 
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T^{x,y,z) — dt^ . This in turn depends on the Galois structure of the Neron- 
Severi group of the surface, which can be determined from the action of Galois 



on the lines on that cubic surface, as explained in |W1|. We must be careful 
here because, unlike + + = dt^, the surfaces N(x,j/, z) = dt^ are not 
smooth: each has an A2 singularity at (x : y : z : i) = (1 : — 1 : — 1 : 0). Thus 
each has, not 27 lines as usual, but 15, of which 6 go through the singularity; 



see [BW|. Explicitly, these are the preimages under the projection to {x : y : z) 
of the three coordinate axes and the two tangents to the curve at (1 : —1 : —1). 
We conclude that, as with ([l9|), Y[p9pi^) converges unless d is a cube. So we 
expect the number of unparametrized solutions of height < TV to grow as log N, 
except when c? is a cube, when it should grow faster, albeit still as a power of 
log N — perhaps log'^ N, by analogy with Manin's conjecture for cubic surfaces. 



Unlike the case of ( 19), we know of no solutions of N(x, y,z) = ±1 in noncon- 
stant polynomials x,y, z G Z[i], other than the trivial ones with xyz — 0. Never- 
theless we can find infinitely many nontrivial integer solutions parametrized by 
Fermat-Pell equations, and thus show that the number of solutions of height < N 
is ^ logA^. There are several ways to do this. In 1982 we found a somewhat 
complicated route to such a parametrization, obtaining a family of solutions 
starting with N(16948, 31226, 186919) = -1. The details may be found in the 



pages of |CM|. Many years later, we observed that a simpler approach is to factor 



N(2;, y, z) = ±1 as 

27xyz ^ {z-y~ xf ^1 ^ {z-y- x^\)[{z-y- xY ±{z-y - x) + l\. (24) 

For each r e Q*, we obtain a conic curve Cr by setting (z — y—x^l) — rx in (^). 
This can be viewed geometrically as follows: the affine surface N(a;,?/, z) = ±1 
contains the line x = {z — y — x ^ 1) = 0; thus the intersection of the surface 
with any plane (z — y — a; =F 1) ^ rx containing that line is the union of the line 
and some residual conic, which is our Cr- Likewise we could start from the line 
z = (z — 1/ — x=f1) = and intersect it with a variable plane {z — y — x^V) = rz. 
For many choices of r, one of these conies is a hyperbola with infinitely many 
integral points parametrized by a Fermat-Pell equation. 

In retrospect this approach to N(a;,y, z) = ±1, in which we fiber an affine 
surface by conies that may be regarded as principal homogeneous spaces for 



Fermat-Pell equations, seems a remarkable premonition of our later analysis | E1 | 
of the projective quartic surface + B'^ + = via a fibration by genus-1 
curves (principal homogeneous spaces for elliptic curves). In both cases the ap- 
proach finds infinitely many solutions but docs not readily lend itself to effi- 
ciently finding all solutions of height < N . Again a later computation found 
that the solution that was discovered first, because it lies on the first fiber that 
could contain a solution, is not the one of smallest height. We used our algo- 
rithm to find all small values of N(a:;,2/, z) with < x,y,z < 10^. We found 
that the smallest solution of N{x,y,z) = ±1 is (14,84,313) of norm -fl, fol- 
lowed by (6818, 4996, 46879), (20388, 4881, 86830), and (2742, 32540, 96843) each 
of norm -1, the known (16948,31226,186919), and (3408,182899,370338) of 
norm -|-1, with no further solutions up to 10^. We also found several primitive 
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solutions of N(x, y, z) — ±8 and a few sporadic examples with d small but not 
a cube, which could not have been obtained at all using the factorization trick; 
the smallest of these are 

N(204, 115327, 162434) = 17, N(650, 1425, 7899) = 26. (25) 

The N(a;,y,z) = 17 solution yields a disappointingly large value of M3 because 
the conjugates z^/^ — y^^^ — g±27rj/3^i/3 ^^.g smaller than usual. An unexpected 



result — since the identity ( |10[) cannot be used with exponents < 1 — was 
a polynomial solution of N(x,?/,z) — 108, namely {A,y{t),—y{—l — t)) where 
y{t) = At'^ — 6t + 3. We can write this symmetrically as {8,g{t), —g{—t)) where 
g{t) = 8t^ — 12t — 6t + 11, a cubic polynomial determined up to scaling by the 
condition that the Laurent expansion at infinity of (.g(t))^/^ have vanishing t~'^ 
and t~'^ terms. In this form, N(a;,y, 2;) is the larger constant 864 = 2^108, but 
with the bonus that x is a cube so U3 involves one fewer surd; for instance, taking 
t = 7 we find that v^3279 is smaller than 2 + 5\/T7 by less than 3.75- 10"'^. In this 



family, as with the first example in (25), U3 is of order z~^, not z~^^^, because 
two of the conjugates of U3 are 0(1). 

A similar investigation of M4 :— z^/* — (a;^/'* + y^^^) was not as productive, 
perhaps not surprisingly since there are no arithmetic reasons to expect many 
nonzero small examples. For the record, the smallest z^^/^|u4| value found for 
z < 10^ was 0.365+ for (.T,y,z) = (241,691,6759), while the smallest |u4| in 
that range was (3.23-) • IQ-^^ for (37792,36109,591093). 

The TT-th Fermat curve. To illustrate our algorithm also for non-algebraic 
curves, we chose to apply it to the Fermat curve of exponent tt. Since tt exceeds 
3, but only slightly, we expected that \z'^ — y'^ — x'^\ achieves a global minimum 
over all x, y, z with < x < y < z but that the minimum might involve numbers 
of several digits. We were rewarded with the example 

2063^^ + 8093^^ - 8128'' = 0.019369- = 8128''~V(184.75+), (26) 

which seems likely to be the minimum of \z'^ ~ y'^ — x'^\ over all positive integers 
X, y, z. At any rate, according to our computations it is the smallest with z < 10^. 
The ratio 184.75+ is also the largest in that range, though there is also 

1198"^ + 4628^^ - 4649^^ -(0.04949+) = -4649^^7(66.794+). (27) 



It will probably be a long time before the question of the minimality of (26) is 
settled; a weaker but still intractable conjecture is that there are only finitely 
many integer solutions of jz'^ — y'' — < 1. 

The Klein quartic. All our examples so far were Fermat curves, even 
though some had unusual exponents 1/3, 1/4, tt. Probably the best-known pro- 
jective plane curve that is not a Fermat curve is the Klein quartic K{X, Y, Z) = 0, 
where 

K{X, Y, Z) X^Y + Y^Z + Z^X. (28) 

We used our algorithm to search for small values of K{x,y,z). By symmetry 
we may assume max(x,y, z) = z. We are then seeking rational points near 
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a segment of a plane curve with a single inflection point, at a; = y = 0. 
The tangent a; = at this point accounts for the obvious family (0,1, z) with 
K{x, y, z) = z. Our computation up to height 10® quickly revealed a less obvious 
family, K{l,—t^,t^) — —t'^, with K(x,y,z) growing even more slowly than the 
height. As usual we also found sporadic examples, though here (as with several 
other cases we have already seen such as the Fermat quintic) the best ones are 
small enough that our algorithm was not needed to locate them: 

K(1421, -1057, 1501) = -49, 
i^(7211, -8381, 11010) = -121, (29) 
i\:(-1550, 11817, 32615) = 245, 

with z/\K{x,y,z)\ = 30.6, 91.0, 133.1 respectively. The largest z/\K{x,y,z)\ 
found with z £ [10^, 10®] off the singular cubic y^ + x'^z = was 6.756+, from 
i^(-7871, 175577, 829244) = 122741. 



4 Hall's conjecture 

4.1 Reviev^r of Hall's conjecture 

By Hall's conjecture me mean the following assertion: if x, y are positive integers 
such that 

k:=x^- y^ (30) 
is nonzero (equivalently, such that (x,y) ^ {t'^,t^)), then 

|fc| », (31) 

(While this accords with current usagCjit is not exactly what Hall originally 
wrote: as F. Beukers points out. Hall Q conjectured |fc| ^ x^^^, a stronger 
statement which is probably false — the usual heuristic suggests that there are 
at least {5 + o{l)) logX c ases of < |fc| < S^/x with x < X — but unlikely to be 



soon disproved. See also [ BCHS| for the early history of this conjecture.) Among 



several equivalent forms of (pll) we note the conjecture that the discriminant 
of an elliptic curve over Q in its standard minimal form has absolute value 
|a4|"'^/^^'^. Known lower bounds on |fc| are much weaker than (^l]). By Siegel's 
theorem on the finiteness of integer points on elliptic curves, each nonzero fc g Z 
occurs finitely many times a,s x^ — y^, so |fc| — > oo as a; ^ oo. Siegel's proof is 
ineffective and thus says nothing about how fast |fc| must grow with x. Starting 
with Baker's method, effective bounds have become available, but they are still 
very weak. For instance, it is not yet possible to prove for any 9 > that 
\k\ > a;^ 

Hall's conjecture is now recognized as an important special case of the Masser- 
Oesterle ABC conjecture Q (see also p|]) . Th us its analogue over function fields 



is known to be true by Mason's theorem |M3|. In the special case of Hall's con- 
jecture for polynomials x{t),y(t), the fact that x^ — y^ is either zero or has degree 
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> ^ deg(a;) was proved some twenty years earlier by Davenport [D2] in response 
to a question raised in | BCHS| |. As in [E2] it follows that the conjecture cannot be 



disproved by a polynomial parametrization, and indeed in any polynomial fam- 
ily {x{t),y{t)\t g Z) we must have k ^ with 6 > 1/2. One does better with 
solutions parametrized by Fermat-Pell equations, i.e. x,y £ Z[t, ^/af^ + bt + c] 
for some a,b,c £ Z such that v? = at^ -\-bt + c has infinitely many solutions. The 
function field Q(t, V at'^ -\-bt + c) is then still rational, so the Davenport-Mason 
inequality again holds, but since now there are two places at infinity one can 
have x^ — of degree exactly ^ deg(x), and thus attain 6 = 1/2. The existence 
of a single such family (exhibited below) shows that the exponent in (^l]) cannot 
be raised above 1/2. The fact that one cannot reduce 9 below 1/2 in this way was 
again observed in in the more general context of the ABC conjecture. This 
fact lends some credence to that conjecture, and thus to its special case (|3l|); 
this contrasts with the situation for |z" — y" — where there is no reason why 
some polynomial or Pell family might not do better than the z"^-^^"^ expected 
by probabilistic heuristics, and indeed we found such families for some choices 
of n. 

We next digress to say some more on polynomial and Fermat-Pell families 
that attain the Davenport-Mason bound, both because they are of independent 
interest and because families of both kinds appear in our numerical results. In 
either case x^ — = fc is an identity in a genus-zero function field, namely 
Q(t) in the polynomial case and Q(i, ^/ at^ + bt + c) in the Fermat-Pell case. Let 
X, y have degrees 2m, 3m respectively, and suppose k has the smallest degree 
possible, i.e. m -f 1 in the polynomial case and m for Fermat-Pell. Then / := 
x^ jy^ is a rational function of degree 6m or 12m on ramified only above 
0, 1, cxD. The Riemann existence theorem provides infinitely many such functions 



/ = x^ jy^ in C(t); this answers the first part of the question raised in [BCHS, 
p. 68]. The second part concerns solutions over R, and can probably be settled 
by adding data on complex conjugation to the branched covering. But we are 
most interested in the third part of the question, in which / must have rational 
coefficients. Given any one (a;(i), y{t)), we may trivially obtain others of the form 
{x' ,y') = {X^x{t'), X^y{t')) where t' — at + bin. the polynomial case, and t' e Q[t] 

with V at'^ + bt' + cj \J at'^ + bt + c £ Q[t] in the Fermat-Pell case. If we regard 
such [x' ,y') and {x,y) as equivalent, only a handful of examples over Q are 
known, and there may well be no others. We next list representatives of the 
known examples. 

In the polynomial case, all known examples have m < 5. For m = 1, trans- 
lation and scaling brings any quadratic x{t) to the form t^ + 2a, and then 
y = t^ + Sat and k = Sa^t'^ + 8a^. Necessarily a ^_0, and all such examples 
are "twists" of each other, becoming isomorphic over Q if not over Q. Note that 
x^/y"^ is a degree-3 function of t"^ with a triple zero. This function occurs for 
instance as the cover of the modular curve X(l) by Xo(2). For m = 2 we again 
find that the solution is imique up to twist: x = t'^ + Aat, y = t^ + 6at^ + 6a^, and 
k = —8a^t^ — 36a''. This time x^/j/^ is a degree-4 function of t^, whose ramifica- 
tion identifies it with the modular cover Xo(3) X(l). Birch found examples of 
(x, y, k) with m = 3, 5 and included them in a 29. ix. 1961 letter to Chowla; they 



24 Noam D. Elkies 



are reported in | BCHS ] : 



36**^ + 24*" + lOi^ + 1, 216t^ + 216i^ 



and 



(32) 



+ 15*^ + 12), 



L_ 

27 



+ 



t^' + 4r + St" 



+ 



^ + lAt^ + 27 



2 ' 108 

'(33) 

These yield integer solutions if t is a multiple of 4 in ( p2[ ) or congruent to 3 mod 6 
in (^3|) . As noted in | BCHS | , the second example provides infinitely many integer 
solutions of \x'^ — ^ x'^^^] moreover, for this choice of twist, the leading coef- 



ficient of k{t) is small enough that 



y \ is even a respectably small multiple 



of x^/'^ for the first few specializations of t. The maps / — x^ jy^ associated with 
Birch's polynomials both have interesting Galois groups. For (|3^), / is a degree-9 
function of whose Galois group is PSL2(F8) over C(i^) and Aut(PSL2(F8)) 
over Q(t^); the Galois closure is the Fricke-Macbeath curve For (H), 

/ is a degree-10 function of whose Galois group is PSL2(F9). These groups 
and curves do not arise in connection with classical modular curves, but they 
can be identified with certain Shimura modular curves, most naturally those 
associated with with the (2, 3, 7) and (2, 3, 8) arithmetic triangle groups (see for 
instance ]l|,p5t). Hall p. 185] gives an example with to = 4: 

X = + + 21t^ + 50t^ + mt'^ + 114*3 ^ ;^Q9^2 _^ ^ 28); (34) 



In August 1998 I announced a new example with to 
be explained elsewhere): 



5 (its computation will 



X = t^''+2tV33t* + 12t''+378t®-336t^+2862t*-2652t^+14397t^-9922*+18553. (35) 

In both cases (as with all the other (x,y,k) examples), y is obtained by trun- 
cating the Laurent expansion at infinity of x'^/^ after the constant term. Neither 



4h nor (35) yields an interesting Galois group: the Galois groups oix'^/y^ are 



Alt24 and Symgp respectively. While (|35|), like (p3D, must yield infinitely many 
integer solutions of jx^ — 2/^| ^ x^^^, the leading coefficient of k in ( |35| ) makes 
the implied constant much larger, and none of these solutions will appear in our 
list of small values of 



X — y \. The question, raised in |BCHS|, whether there 



are any x,y,k € Q[i] of degrees 2to, 3to, to -I- 1 with m > 5, remains unsolved. 

For Fermat-Pell families, the list is even shorter: all known examples are 
equivalent, and come from the identity 



{t^ + lot + 5f - {t^ + 22* + 12b){t^ + 4* - 1) 



1728*. 



(36) 



Here j/ is a multiple of V a<^ + bt ^ c, so / factors as a map of degree 6 composed 



with the double cover of Q(*) by Q(*, ^/at'^ +ht + c). We noted in |EJ, p. 49] that 
the resulting degree-6 map / = x^ jy^ : — > is the cover Xo(5) — > X(l) 
of classical modular curves. Thus the elliptic curves of low discriminant coming 
from the identity (Bq) all admit a rational 5-isogeny. Each Fermat-Pell family 
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obtained from (36) by specifying the class of + 22i + 125 mod Q*^ yields 



k - Ca;i/2 for some nonzero C. The smallest such C is 5-^/254 ^ j6598 . 



obtained by Danilov |D1] by substituting 125(2t — 1) for t in ( p6[ ) and dividing 
by 203; 

(55i2-3000i + 719)^-(53t2-ii4t + 26)(5^t2_53j^23t+3781)2 = 27(2t-l). (37) 

The factor b^t^ — 114i + 26 is a square for t = —5, and thus for infinitely many t. 
The first case t = — 5 of this yields the elliptic curve of discriminant —11 labeled 
11-A2(C) in Cremona's table ]C2| ; it is known that the isogeny class of this 
curve provides the examples with minimal conductor of a rational 5-isogeny, 
and indeed of an elliptic curve over Q. 

4.2 The new algorithm 

To obtain numerical data with which to compare Hall's conjecture, we want to 
find all small nonzero values of — | with x, y S Z and a; < X. So that we can 
compare our algorithm with other approaches we briefly review previous work 
on this problem. 

The most direct approach is to simply compute for each x < X the integer y 
closest to x^/'^. Since x^/'^ varies smoothly with x, this can be done quite effi- 
ciently, but clearly must take at least time proportional to X. This is essentially 
what Hall did in with X = 1 ■ 10®; some three decades later, faster comput- 
ers make larger X feasible, and indeed Frits Beukers reports in an Aug. 1998 
e-mail that he performed such a computation for X = 10^^. But this is probably 
close to the practical limit with today's technology, and at any rate this direct 
approach is superseded by the X^/"^ \o^^^^ X algorithm described below. 



A fundamentally different approach is taken in [GPZ]: for each nonzero k S 
[—K^K\, investigate the arithmetic of the elliptic curve Ek : — x^ — k, and 
use effect ive bounds on integral points to find all integer solutions of x^ — y^ = k. 
In ||GPZ[ , Gebel, Petho, and Zimmer did most of this work for K = 10^, except 
for a few values of k, for which they could n ot fi nd a generator for ^^^(Q); 



Wildanger later showed in his doctoral thesis [W2| that none of these Ek has 
an integral point, thus completing the computation of integer solutions of < 
\x^ — 2/^1 < 10^. It is not clear even heuristically how this method compares 
with other approaches. It is the only approach used thus far that will provably 
find all solutions with |fc| < K. (The recent proof of the modularity conjecture 
means that Cremona's algorithms []C^ yield another such approach, but to my 
knowledge it has not been used to solve x^~y^ — k.) Assuming Hall's conjecture, 
\k\ < if is equivalent to x <Ce if^^'^, but this begs the question of the constant 
implied in Neither do we know how to estimate the average work required 
to find all integer points on a curve E^. It may be reasonable to guess that this 
average work is proportional to K'^~^°''^^ for some c > 0. (This estimate certainly 
holds for Cremona's algorithms.) The total work would then be K'^+^+°'^'^) , Under 
Hall's conjecture, this is equivalent to so strictly worse (modulo an 

unknown implied constant) than our X^^^ log'~^^^^ X algorithm, though perhaps 
better than a direct search, depending on whether c < 1. 
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We noted already that the direct search can exploit the smoothness of the 
function x^^"^ . We can try to take further advantage of this by mimicking our ap- 
proach to rational approximation of curves: surround the segment x < X oi the 
semicubical parabola y = x^l"^ by a union of parallelograms each of area 0(1), 
and use lattice reduction to quickly find all integer points in each parallelogram. 
This does give an asymptotic improvement, though a small one: the parallelo- 
gram containing a point (x^x^l"^) has length ^ x^l^ ^ so the computational cost 
is reduced by at most X^/^ to X^I^Xog^^^'^ X . 

We reduce the exponent of X from 1 or 5/6 to 1/2 by a more radical reorga- 
nization of the computation that lets us apply lattice reduction more efficiently. 
More generally, for each positive c G Q we can find all cases of < \cx^ — y'^\ ^ x 
in time OciX^^"^ log'^^^'^-'X). All choices of c are essentially equivalent: we get from 
one to the other by scaling a;, y and imposing congruence conditions on them. 
The most convenient choice of c turns out to be 4/3. We thus show how to solve 
< y^x^ — 3j/^| <C x\ the cases relevant to Hall's conjecture are those with 3|a; 
and 6|y, when (4x3 - 3?/2)/l08 = (xjif - {y/&f. 

We begin as in by approximating x, y by (multiples of) a square and a 
cube. Any positive integer x may be written uniquely as 

a; = 3C^-|-r; with r;,C e Z,C > 0, 7/ e (-3C,3C]. (38) 

Then 

{AxVif^ = + 2^^^ + if " ^ (?) + 
We thus write 

y = 6C^ + 377C + i (40) 

with ^ ^ C- More precisely, if 

V = K (41) 
Then (3 e (-3, 3], and |4a;3 - 3j/2| < x if and only if 

e=|^-^/?' + 0(l/C). (42) 

At this point, Hall imposes the assumption (3 <^ We allow an arbitrary 

(3 G (—3,3] and approximate it within 0(X^^/^) by one of 0{X^/^) evenly 
spaced points in that interval. Suppose, then, that b is one of those points. We 
approximate (^) by a linear combination of ?7 — bC,, and 1: 

62 b 53 fe2 5 53 

^ = + 2 (r? - feC) - + 0(1/C) = - jC + 2^ - 7^ + 0(1/C). (43) 

We now assume that C ^ X^/'^, for instance by requiring that x > X/A; repeating 
the computation with X replaced by X/A, X/IQ, X/6A,. . . will then cover the 
entire range x < X, and if we can cover {X/A, X] in time 0{x^^^ log~^^^^ X) then 
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the same is true of Under the assumption x G {X/4:,X], we have the 

following constraints on ^, rj, C: 

C<Xi/2, 77-6C<l, (44) 

and 

e + jC-^^y+^«X"V^ (45) 

We are thus in a familiar situation: we seek all the integral points in 0{X^^^) 
parallelepipeds, each of volume 0(1). The term in ( ^5|) means that the 

parallelepipeds are no longer centered at the origin, but this causes no diffi- 
culty — indeed we already dealt with off-center parallelepipeds in the practi- 
cal implementation of our algorithm for finding rational points near curves. So 
again we linearly transform each parallelepiped to a cube and obtain a lattice 
reduction problem; if these lattices were randomly distributed among three- 
dimensional lattices, we would almost certainly have only 0{X^^^) points to try, 
and would thus find all solutions of < \4:X^ — 3y^| ^ x with x < X in time 

0(Xl/2l0g°(l)X). 

In fact it turns out that in this case our lattices are not equidistributed: they 
all lie in a 2-dimensional subspace of the 5-dimensional moduli space of lattices 
in R''. This gives rise to both a minor annoyance and a major advantage. The 
bad news is that we cannot expect our lattices to have on average 0(1) vectors of 
norm <C 1; but this annoyance is minor because the actual average is proportional 
to logX and thus can be absorbed into the log^'^^^X factor. The good news is 
that we understand our special lattices well enough to actually prove results that 
are only heuristic for rational points near curves. 

The key is that in each case our lattice is a symmetric square of a lattice 
in R2. By this we mean the following. Recall that the symmetric square of a 
2-dimensional vector space V is the 3-dimensional vector space Sym^V^ consisting 
of symmetric tensors inVi^V. Since Sym^V is defined naturally in terms of V, 
any linear transformation of V yields a linear transformation of Sym^V. We thus 
have a homomorphism Sym^ : GL2 — *■ GL3 . To give this map explicitly we choose 
a basis {61,62) for V, and use the basis (ei (g) ei, (ei (8) 62 -I- 62 (8" ei)/2, 62 (8) 62) 
for Sym^V. We then calculate that 

/ \ / pq 9^ \ 
Sym^ (P'^^] = {2prps + qr2qs\. (46) 
V / y ^2 g2 J 

Over any field, Sym^(SL2) is contained in the subgroup of SL3 preserving the 
discriminant form 4aia3 — on Sym^{V)\ if we worked over an algebraically 
closed field, that subgroup would coincide with Sym^(SL2). Now (|4^ , ^ ) mean 
that the column vector v = (^,?7, C) S Z'^ satisfies ||Mf,-y — Ub\\ <^ 1 where 
Ub := (0,0,-6V72) and 

Mb-.^l 1 1 - Sym^ ( J^,, ^,7'^' ) . (47) 
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This is why we went after 4x^ — 3t/^ rather than pursuing — directly: an 
analogous approach to — would yield a matrix that is still a symmet- 
ric square but with respect to a different basis, requiring a definition of Sym^ 
with fractional coefficients and complicating the lattice reduction. Note that the 
quadratic form 4fC~ ^7^ preserved by Mb e Sym^(SL2) is already visible in (^^. 

Our algorithm, then, is as follows. For each of our 0{X^/^) choices of b, 
calculate the matrix 

with Alb = Sym^ = Nb- Use lattice reduction to find a matrix Kb e GL2(Z) such 
that NbKb is as small as possible. Then 

M'b := Syia^{NbKb) = Sym^A^b Sym^ Kb = Ah Syia^ Kb- (49) 

is small too. Let Lb = Sym^ Kb £ GL3(Z). Then MbV = M'^L'^^v. Find a box 
containing all w S such that — Mf,|| ^ 1. For each w in the box, 

compute V = LbW and check whether the resulting x, y satisfy x £ {X/4, X] and 
< \4:X^ — 3y^\ <C x; if they do, output x (and check whether 3|a; and 6|y to 
determine whether this solution also yields a small value of a;"^ — y^). This is easier 
than our usual algorithm because we are reducing a lattice in rather than 
R^, which in our case amounts to calculating the continued fraction of b/ X^/"^. 
Moreover, the computational cost of the algorithm can be bounded rigorously: 
Ml will only be large if bjX^I"^ is close to a rational number with numerator 
and denominator <C X^l^ , and the effect of such a close rational approximation 
is easy to determine. Summing over all rationals of height ^ X^l^ we find that 
the total number of candidate vectors w is ^ X^I'^XogX ^ and thus that the 
computation takes time 0{X^I^ log'^'-^^X) as claimed. 

Note that the X^I'^XogX bound also has the following consequence: there 
are <C X^/^ logX solutions of — <^x with x <X. Moreover, if C is large 
enough, we can deduce from this analysis that there are ^ X^l"^ solutions of 
< |a;'^ — y^l < Cx with x £ [X/2,X]. More generally, we show that for each 
positive c S Q there exists C such that for each r £ R/Z and d > 1 there are at 
most CdX^/^logX solutions of \{cx^Y^^ - {y + r)\ < dX^^/"^ with x,y £Z and 
X < X\ and, given c as above and any £ [0, 1), there exists Co such that for 
any r £ R/Z there are » X^/^ solutions of Kca;^)^/^ -{y + r)\ < CqX^^I"^ with 
X, y G Z and x £ \QX, X\. The constants C, Cq depend effectively on c, Q. These 
results improve considerably on results in this direction available from general 
exponential-sum techniques for proving uniform distribution mod 1. The detailed 
proofs of our claims in this paragraph will appear elsewhere. 

4.3 Numerical results 

We have implemented our algorithm in a C program using 64-bit integer arith- 
metic, again replacing each 0(- • •) and <C by explicit bounds, and searched for 
all solutions of < \4x^ - Sy^l < 200x^/2 with 4 • 10^ < x < 3 • 10^^. The range 
X < 10^° was covered by a direct search, the overlap [4-10^, 10^°] being used as a 
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check on the computation. The code was processed with an optimizing compiler 
and ran for three weeks during the summer of 1998 on a Sun Sparcstation Ul- 
tra 1. As a corollary we obtained all cases of < |a;'^ — < \\fx with x < 10^^. 
(With currently available hardware the same computation could easily finish in 
a few days; with parallelization it should be feasible to reach 10^^ at least.) The 
next table lists, for each of the 25 solutions of < jx'^ — < ^y/x, the values of 



y 



^, X, and r 



a/2 



/\k\. We need not list y, which is always the integer 



nearest to a;'^/^. The explanation of the last two columns follows the table. 



# 


k 


X 


r 


GPZ? 


Comments 


1 


1641843 


5853886516781223 


46.60 




!! 


2 


30032270 


38115991067861271 


6.50 




I 


3 


-1090 


28187351 


4.87 


+ 




4 


-193234265 


810574762403977064 


4.66 






5 


-17 


5234 


4.26 


+ 


P(-3) 


6 


-225 


20114 


3.77 


+ 




7 


-24 


8158 


3.76 


+ 


P(3) 


8 


307 


939787 


3.16 


+ 




9 


207 


367806 


2.93 


+ 




10 


-28024 


3790689201 


2.20 


+ 




11 


-117073 


65589428378 


2.19 






12 


-4401169 


53197086958290 


1.66 






13 


105077952 


23415546067124892 


1.46 




* 


14 


-1 


2 


1.41 






15 


-497218657 


471477085999389882 


1.38 






16 


-14668 


384242766 


1.34 


+ 


P(-9) 


17 


-14857 


390620082 


1.33 


+ 


P(9) 


18 


-87002345 


12813608766102806 


1.30 






19 


2767769 


12438517260105 


1.27 






20 


-8569 


110781386 


1.23 


+ 




21 


5190544 


35495694227489 


1.15 






22 


-11492 


154319269 


1.08 


+ 




23 


-618 


421351 


1.05 


+ 




24 


548147655 


322001299796379844 


1.04 




D 


25 


-297 


93844 


1.03 


+ 


D 



The 



'GPZ" column indicates whether the solution was among the 13 listed 
. These are the solutions with 1 < |fc| < 10^. Presumably the solution 



in PPZH 

2^ — 3^ = — 1 is not on that list because the elliptic curve = x^ + 1 was already 
known to have rank so Gebel, Petho and Zimmer were not interested in it. 

The #1 row is a new record, improving the previous record r by a factor of 
almost 10, whence the notation "!!" . Even row #2, marked "!" , has r larger than 
the old record which is row ^ 3. Either of this suffices to refute Hall's comment 
[P, p. 175], repeated in | GPZ |, that ?■ < 5 seems to hold in all cases. 

*: Obtained from row #1 by scaling {x, y, k) to {2'^x, 2^y, 2^k). This reduces 
r by a factor of 32, but r = 46+ in row #1 is large enough that even r/32 still 
exceeds the threshold of our table. 
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P{t): Birch's polynomial family (^). This has r — 12/t + 0(t^*), so the only 
values of t = 3 mod 6 that appear on the r > 1 list are t = ±3 and ±9. Already 
in |BCH£ , p. 69] the specializations t ~ ±3 are noted as "striking special cases" 

D: The first two cases of Danilov's family (^7|). The appearance of the larger 
of these was a welcome check on our computation. 

Any threshold on r is of necessity arbitrary; the next solution has r just 
below our cutoff of 1: [x, k, r) = (16544006443618, 4090263, 0.9944 . . .). 
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